CDS

Accession Number TCMCG064C35138
gbkey CDS
Protein Id XP_020547332.1
Location complement(join(10209..10301,11125..11964,12158..12238,12349..12438,12638..12759,13134..13236,13354..13530,13861..13936,14227..14311,14414..14611,18542..18578))
Gene LOC105180372
GeneID 105180372
Organism Sesamum indicum

Protein

Length 633aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268358
db_source XM_020691673.1
Definition uncharacterized protein LOC105180372 isoform X1 [Sesamum indicum]

EGGNOG-MAPPER Annotation

COG_category D
Description PP-loop family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R09597        [VIEW IN KEGG]
KEGG_rclass RC02633        [VIEW IN KEGG]
RC02634        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko03016        [VIEW IN KEGG]
KEGG_ko ko:K04075        [VIEW IN KEGG]
EC 6.3.4.19        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGCGATGGCTGGCCTCAAACCCCATCACCACATCGCAATGGCTGTTTCTGGTGGTCCTGATAGTATCGCGCTCTGTATATTAGCTGCTGGTTGGAAATCTAATGACTTTGATGCTGCTGCCAATAGAAGAAACAAGTTCATTGACGGTCTTTTAGCAATTGTTGTGGATCATGGATTACGTAAGGAAAGTGCGGAGGAAGCAAACCTTGTTTACCAGCGAATTACAGATATGGGAATCAAATGTGAAGTTGCTCGCTGTGAATGGTTGGATGGTAGACCCAAGGTTGGTCACTTGCAAGAAGCAGCTCGCAATAAAAGGTATCAAACTTTACAAAACATATGTAGCCAGCTGCAGATTGGGATATTGTTAACCGCACATCATGCTGATGACCAGGCGGAGCTATTCATTCTCAGATTATCTAGAAATAGTGGCATTCTTGGTCTTGCTGGCATGGCATTTACTTCTCAAATGTTCCCGGAATTTCCTGATATTAGAGGAGAAGGATCAAAAGCTCATGGCATTATATTGGTCAGACCACTTCTAGAGTTCTCAAAAGAAGATATGTACAATATTTGTCAAGCTGGTTATAAGAAATGGGTGGAGGATCCAACAAATAGGAGTCCCTTGTATGCTCGCAATAGGATCAGAATGTCACTGTTTAACCTGTCATCACCTGTTTTCAAGGCTGAGCTGCAAGCAGTCATATCAGCATGTCGAAGAACGCGATTGCATGTTGACAATGTTTGTCGCCTGCTGCTGAATCAGGCTGTGACCGTCATGCCTGTATGCTCTTCACATGGATATGCAGTGATTGATCTGGGAAATCTCCATGCAATGGAGGTCAAGGACATTTACCTTGCAAAGTTTGCTGCTATGGTATTGCAGTTCATCTCACAAAGACATAGGCCTGTTCGAGGGAATGCATCAAAATTGCTACTAAGCTACCTTCGTACTTTTCCATGCAAGACTTGTCTCACTGTAGCCAGCTGTTACCTATGTCCAGCACCAGGATCCAAAGGAACTCAGGTCCTGGTGTGCTGTTCTGTTAATTCATCTTTGCCTCCTATGGTAAAGTTGTTTCATGGATGCTCATATGTACGGGAAAATTGCTTTGCTAAAAGCGAGTTAGAGCAGATTATAAAAGAGAGTGAAGCATATTTAAATAGACTCCTACCAGATGCTTCAAGCGTTCCATTTTTGGATATGGCATCCTCTGAGTCCGTTCTAACTGAAGCCAAAAAATGTGGCATTCTTAGTCACTGTACCCATAGGAGTATTATTTCTCTGCAGAAGGAAGAAAGTGAAAATTTTAAGTCCAAAGCTGAATATCTCTCTGATGTGTCAAAAGATGACGTAAGATCTTCAGGTGCAACTCTGAGCCAATTATTTTATCCTGGGCAAGTGGGATACTTCATGAATAGATTTGTATTGGATTGGAAAGTAAGCAATACAGGTTCTTGTAATGCATTGTGTACAAATGAGGTTGTTGCTGTCAAGGAGCTGGGTACAGAAGGACAATGTTTTTGCAGTTCTTGCATAACTGGGAATCAGAAGGTTGCGGAGGTGCGCCACATGATAGACACTGATTGGATATATCTTTCTAACTTGTTGAAGAAGACAGATATGGGAGACTCTCAATCACCAAGTCACCCTTCTGTAAAAACAGAGCAACTAACTGAAAAAACAACGGATTACGCTGTATTATCAGCACGTCGAGCTCTTGTGTCCTTAAAGTCTATCCCAGTTGCTGCAAGAAGAGCTATGCCTGTCCTGGTTAACGCTGAAGGAGTTCTGCTAAGCATTCCGAGCATTGGCTTCTCATGTTGCCCCCATCTGACGGTCTCTGCTGTTTTCAATCCTAGGGTACCCCTGGATGGAGGATATAGCTCATTTTTGTAG
Protein:  
MAMAGLKPHHHIAMAVSGGPDSIALCILAAGWKSNDFDAAANRRNKFIDGLLAIVVDHGLRKESAEEANLVYQRITDMGIKCEVARCEWLDGRPKVGHLQEAARNKRYQTLQNICSQLQIGILLTAHHADDQAELFILRLSRNSGILGLAGMAFTSQMFPEFPDIRGEGSKAHGIILVRPLLEFSKEDMYNICQAGYKKWVEDPTNRSPLYARNRIRMSLFNLSSPVFKAELQAVISACRRTRLHVDNVCRLLLNQAVTVMPVCSSHGYAVIDLGNLHAMEVKDIYLAKFAAMVLQFISQRHRPVRGNASKLLLSYLRTFPCKTCLTVASCYLCPAPGSKGTQVLVCCSVNSSLPPMVKLFHGCSYVRENCFAKSELEQIIKESEAYLNRLLPDASSVPFLDMASSESVLTEAKKCGILSHCTHRSIISLQKEESENFKSKAEYLSDVSKDDVRSSGATLSQLFYPGQVGYFMNRFVLDWKVSNTGSCNALCTNEVVAVKELGTEGQCFCSSCITGNQKVAEVRHMIDTDWIYLSNLLKKTDMGDSQSPSHPSVKTEQLTEKTTDYAVLSARRALVSLKSIPVAARRAMPVLVNAEGVLLSIPSIGFSCCPHLTVSAVFNPRVPLDGGYSSFL